On the acoustics of spread lips
نویسندگان
چکیده
In response to our incomplete understanding of how the 3-D geometry of the lips should be modeled in 2-D area functions, we present some measurements made on simplified physical models of the vocal tract. Our main result is that it is possible to match the formant pattern of a notched tube with that of an unnotched tube provided that an increment is added to the length of the unnotched tube. This increment was found to vary systematically with the depth of the notch and, to some extent, also with the size of the tube’s cross-sectional area. We expect these results to be relevant to representing articulations with the mouth corner posterior to the midsagittal anteriormost point of the vocal tract as in spread vowels, especially when emphatically and emotionally spoken. Determining the VT area function for spread lips In the acoustic theory of speech production the shape of the vocal tract (VT) is represented as a series of uniform cylindrical tubes, the so-called area function. The cross-sectional areas along the tract can be accurately predicted from information from 2-D lateral profile images of the midsagittal contours of the VT (Heinz & Stevens 1965, Sundberg et al 1987, Ericsdotter 2005). Figure 1. How do we determine the length and area of the lip section from an articulatory profile? What is the effect of the mouth corner on the formant pattern? While this method works well for the pharynx and the oral region, it is more difficult to apply to labial articulations. Standard theory represents the lips in terms of a single section having a certain length and opening area (Fant 1960). The problem addressed in this paper is the mapping of the 3-D geometry of the lips onto these two parameters (Figure 1). To shed some light on this topic we made acoustic measurements of two types of physical models: (i) A set of specially designed cylindrical pipes; (ii) a hybrid device, here referred to as HACMOD, whose front end is a 3-D replica of the anterior portion of the VT and whose posterior part consists of a pile of circular plexiglass washers with centered axial holes. This is work in progress. The cylinder measurements are the primary focus of this report. We expect to have more to say on HACMOD at the time of the meeting. Cylindical tubes with & without notches Our first investigation involved measuring the resonance characteristics of cylindrical cupper tubes with uniform cross-sectional areas. Three diameters were selected: 20, 28, 35 mm. This project was undertaken to provide baseline data for the more complex task of analyzing the acoustic properties of HACMOD. Figure 2 illustrates the design of these tubes. “Lip openings” and “mouth corners” were simulated by cutting and filing the tubes at one end so as to produce wedge-like notches. Use of paper templates glued to the tubes improved the precision of this procedure. The center columns of Figure 2 illustrate their geometry. The depth of the notches was varied between 0, 1, 2, 3, 4, and 5 cm. In the side view they produce straight lines; from the top or the bottom their contours produce a parabolic projection. Tubes without notches were ‘long’ and ‘short’. Short tubes ended at the mouth corner; Long tubes extended from source to the top edge. Figure 2. Cylindrical tubes used to study “spread lip” configurations. The formant frequencies of these resonators were measured by means of a manual frequency search. The sound source was the STL ionophone (Fransson & Jansson 1971) whose sine-wave output was varied in frequency to identify the resonance peaks. It was mounted on a tight fitting piston whose position in the tube could be continuously varied. Special care was taken to achieve an airtight seal between the ionophone rod and the tube. A microphone attached to an oscilloscope was mounted close to the open end of the tube. As a plasma is generated between the ionophone electrodes, heat is dissipated. The temperature of the air inside the tube was continuously monitored by two thermocouples, one placed near the tube opening; the other was inserted through the piston at 2 cm above the source, off-center. During a given session the average temperature inside the tube was about 30° C. To keep temperatures steady, air circulation was achieved by having a fan inject a laminar air stream into the tube. Cylindrical tube data The data set for the measurements were as follows: Overall tube lengths: 12 – 21 cm; Notch depths: 1, 2, 3, 4, 5 cm; For every notch depth, measurements were also made of two tubes without the notch: The full tube length (column 1 in Figure 2) and the short tube (column 4 in Figure 2). Figure 3 presents the plot format used to analyze the data. The temperatures recorded during the experiments were used to adjust all the frequency measurements so as to reflect a temperature of 35° C inside the tube. Figure 3. F1 and F2 measurements plotted against total tube length. Notch = 3 cm. The top line represents the formant value for the short tube [bottom-to-mouth corner]; The lower curve pertains to a full-length tube without a notch. The diagram describes the results for a temperature of 35° C inside the tube. All formants presented patterns similar to those of Figure 3 regardless of the geometric parameter values. Without exception the data for the notch conditions fell inside the margins set by the ‘long’ and ‘short’ tubes (columns 1 & 4 in Figure 2). It can be seen that F2:s data point for the 12 cm notched tube is missing in Figure 3. This is the case because at higher frequencies a resonance (usually F4, sometimes F3 and F2) would often exhibit a low Q value. Making an estimate of the peak in those cases was considered unreliable. Modeling the measured values Acoustic theory enables us to predict what the values for the tubes without notches ought to be. This provides us with an opportunity to check on the numerical accuracy of the measurements. Since the unnotched tubes have uniform cross-sectional areas, their resonance frequency would be expected to be given by: fn=(2n-1)*c/(4*(l+leff)) (1) where n is formant number, c speed of sound and leff is the effective length of the pipe. At 35° C the speed of sound in cm/sec is: c= 33145*SQRT(1+35/273.16) (2) We should note that the effective length of a pipe that is unflanged is different from the end correction used in VT calculations (Fant 1960:36). It is given by leff = l + 0.6*r (3) where r is the cylinder radius (Kinsler et al 2000:274). The measurements were found to deviate by less than 1.3 % from the calculated values. Equivalent lengths for notched tubes In analyzing the results for the notch condition we compared the observations with a set of equivalent unnotched tubes. We asked: Is it possible to equate the acoustic properties of a given notched tube with those of an unnotched tube whose length is chosen so as to match the unnotched formant pattern? For each measured formant value we calculated the following number: (35204/(4* )) ( 0.6* ) inc obs short l f l = − + r (4) where 35204 is the speed of sound at 35° C, fobs is the measured formant value, lshort is defined as the distance between the bottom end and the “mouth corner” of the notch. The assumption underlying the formula is that the notch will add an increment (linc) to the length of the unnotched tube (lshort + 0.6*r). Figure 4. Comparing the effect of notch depth in terms of the increment in length defined in Eq 4. When this increment is added to an unnotched tube whose effective length is [lshort + 0.6*r] a perfect match is obtained with the formant frequency observed for the notched condition. An example of how the length increments pattern is presented in Figure 4. It shows the effect on the length increment (linc) in a tube whose total length is plotted on the abscissa and whose radius is 1.4 cm. The notch depths (1 – 5 cm) are color-coded as shown on the right. Accordingly each colored cluster refers to a given notch depth. A positive finding is that formant number does not affect the increment much. Thus, formant measurements for individual conditions are pooled. The mean value of each cluster is indicated by a horizontal line. Figure 5. Summary of the data on cylindrical tubes with notches. The diagram shows the relationship between length increment, notch depth and tube radius. As the size of the notch increases the increment gets longer. A similar result was obtained for the other two cross-sectional areas. In Figure 5 the length increment is plotted against the size of the notch with the radius of the tubes as parameter (r=1.0, 1.4 and 1.75). For all cross-sectional areas the length increment increases with notch depth. Also to some extent the increment depends on tube radius. However the limitations of our method do not permit us to establish this trend with certainty. Can the formant patterns of notched tubes be described in terms of unnotched tubes of equivalent lengths? In conclusion, it would seem that we can give an affirmative answer to that question.
منابع مشابه
Proceedings of Meetings on Acoustics
The context of this study is the physical modeling of speech production. The first step of our approach is to realize in-vivo measurements during the production of the vowel-consonant-vowel sequence /apa/. This measurements concerns intra-oral pressure, acoustic pressure radiated at the lips and labial parameters (aperture and width of the lips) derived from a high-speed video recording of the ...
متن کاملA Collocation Method with Modified Equilibrium on Line Method for Imposition of Neumann and Robin Boundary Conditions in Acoustics (TECHNICAL NOTE)
A collocation method with the modified equilibrium on line method (ELM) forimposition of Neumann and Robin boundary conditions is presented for solving the two-dimensionalacoustical problems. In the modified ELM, the governing equations are integrated over the lines onthe Neumann (Robin) boundary instead of the Neumann (Robin) boundary condition equations. Inother words, integration domains are...
متن کاملWhich has a Greater Influence on Smile Esthetics Perception: Teeth or Lips?
Introduction: The aim of this study was to evaluate the role of teeth and lips in the perception of smile esthetics. Materials and Methods: Thirty women, ranging between 20 and 30 years of age, all with Class I canine and molar relationships and no history of orthodontic treatment, were chosen. Five black and white photographs were taken of each participant in a natural head position while s...
متن کاملInfluence of lips on the production of vowels based on finite element simulations and experiments.
Three-dimensional (3-D) numerical approaches for voice production are currently being investigated and developed. Radiation losses produced when sound waves emanate from the mouth aperture are one of the key aspects to be modeled. When doing so, the lips are usually removed from the vocal tract geometry in order to impose a radiation impedance on a closed cross-section, which speeds up the nume...
متن کاملIntroducing visual cues in acoustic-
The contribution of facial measures in a statistical acoustic-toarticulatory inversion has been investigated. The tongue contour was estimated using a linear estimation from either acoustics or acoustics and facial measures. Measures of the lateral movement of lip corners and the vertical movement of the upper and lower lip and the jaw gave a substantial improvement over the audio-only case. It...
متن کاملQuantitative association of orofacial and vocal-tract shapes
OROFACIAL AND VOCAL-TRACT SHAPES Hani Yehia1 Philip Rubin2 Eric Vatikiotis-Bateson1 [email protected] [email protected] [email protected] 1ATR Human Information Research Laboratories 2-2 Hikaridai, Seika-cho, Soraku-gun, Kyoto 619-02, Japan 2Haskins Laboratories 270 Crown St. New Haven{CT 06511, U.S.A. ABSTRACT This paper examines the degrees of correlation among vocal tract and orof...
متن کامل